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Abstract 

This paper discusses a special type of multi-user communication scenario, in which users' utilities 
are linearly impacted by their competitors' actions. First, we explicitly characterize the Nash equilibrium 
and Pareto boundary of the achievable utility region. Second, the price of anarchy incurred by the 
non-collaborative Nash strategy is quantified. Third, to improve the performance in the non-cooperative 
scenarios, we investigate the properties of an alternative solution concept named conjectural equiUbrium, 
in which individual users compensate for their lack of information by forming internal beliefs about 
their competitors. The global convergence of the best response and Jacobi update dynamics that achieve 
various conjectural equilibria are analyzed. It is shown that the Pareto boundaries of the investigated 
linearly coupled games can be sustained as stable conjectural equilibria if the belief functions are properly 
initialized. The investigated models apply to a variety of realistic applications encountered in the multiple 
access design, including wireless random access and flow control. 

Index Terms 

Nash equilibrium, Pareto-optimaUty, conjectural equilibrium, non-cooperative games. 

I. Introduction 

Game theory provides a formal framework for studying the interactions of strategic agents. Recently, 
there has been a surge in research activities that employ game theory to model and analyze a wide range 
of application scenarios in modern communication networks [I]- [4]. In communication networks, any 
action taken by a single user usually affects the utilities of the other users sharing the same resources. 
Depending on the characteristics of different applications, numerous game-theoretical models and solution 
concepts have been proposed to describe the multi-user interactions and optimize the users' decisions in 
communication networks. Roughly speaking, the existing multi-user research can be categorized into two 
types, non-cooperative games and cooperative games. Various game theoretic solutions were developed 
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to characterize the resulting performance of the multi-user interaction, including the Nash Equihbrium 
(NE) and the Pareto-optimality [18]. 

Non-cooperative approaches generally assume that the participating users simply choose actions to 
selfishly maximize their individual utility functions. It is well-known that if devices operate in a non- 
cooperative manner, this will generally limit their performance as well as that of the whole system, because 
the available resources are not always efficiently exploited due to the conflicts of interest occurring among 
users [5]. Most non-cooperative approaches are devoted to investigating the existence and properties of 
the NE. In particular, several non-cooperative game models, such as S-modular games, congestion games, 
and potential games, have been extensively apphed in various communication scenarios [6]- [9]. The price 
of anarchy, a measure of how good the system performance is when users play selfishly and reach the 
NE instead of playing to achieve the social optimum, has also been addressed in several cormnunication 
network applications [10] [11]. 

On the other hand, cooperative approaches in communication theory usually focus on studying how 
users can jointly improve their performance when they cooperate. For example, the users may optimize a 
common objective function, which represents the Pareto-optimal social welfare allocation rule based on 
which the system- wide resource allocation is performed [12] [13]. A profile of actions is Pareto-optimal 
if there is no other profile of actions that makes every player at least as well off and at least one player 
strictly better off. Allocation rules, e.g. network utility maximization, can provide reasonable allocation 
outcomes by considering the trade-off between fairness and efficiency. Most cooperative approaches focus 
on studying how to efficiently find the optimum joint policy. It is worth mentioning that information 
exchanges among users is generally required to enable users to coordinate in order to achieve and sustain 
Pareto-efficient outcomes. 

In this paper, we present a game model for a particular type of non-cooperative multi-user communi- 
cation scenario. We name it linearly coupled communication games, because users' utihties are linearly 
impacted by their competitors' actions. In particular, the main contributions of this paper are as follows. 
First, based on the assumptions that we make about the properties of users' utihty, we characterize 
the inherent structures of the utility functions for the hnearly coupled games. Furthermore, based on 
the derived utihty forms, we explicitly quantify the NE and Pareto boundary for the hnearly coupled 
cormnunication games. The price of anarchy incurred by the selfish users playing the Nash strategy is 
quantified. In addition, to improve the performance in the non-cooperative scenarios, we investigate an 
alternative solution: conjectural equilibrium (CE). Using this approach, individual users are modeled as 
behef-forming agents that develop internal behefs about their competitors and behave optimally with 
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respect to their individual beliefs. Necessary and sufficient conditions that guarantee the convergence of 
different dynamic update mechanisms, including the best response and Jacobi update, are addressed. We 
prove that these adjustment processes based on conjectures and non-cooperative individual optimization 
can be globally driven to Pareto-optimality in the linearly coupled games without the need of real-time 
coordination information exchange among agents. 

The rest of this paper is organized as follows. Section II defines the linearly coupled communication 
games. For the investigated game models. Section m explicitly computes the NE and Pareto boundary 
of the achievable utility region and quantifies the price of anarchy. Section TV introduces the CE and 
investigates its properties under both the best response and Jacobi update dynamics. Conclusions are 
drawn in Section V. 

II. Game Model 

In this section, we first provide a general game-theoretic formulation of the multi-user interaction in 
communication systems. Following the proposed definition, we define the linearly coupled communication 
games and provide concrete examples of the investigated game model. 

A. Linearly Coupled Communication Games 

The multi-user game in various communication scenarios can be formally defined as a tuple F = 
{J\f,A,u,S,s). In particular, Af = {1,2, .. . ,N} is the set of communication devices, which are the 
rational decision-makers in the system. Define A to be the joint action space .4 = x neAfAn, with An 
being the action set available for user n. As opposed to the traditional strategic game definition [18], 
we introduce two new elements S and s into the game formulation. Specifically, S is the state space 
•S = ^neAf^n, where Sn ^ is the part of the state relevant to user n. The state is defined to 
capture the effects of the multi-user coupling such that each user's utility solely depends on its own state 
and action. In other words, the utihty function u = XneAfUn is a mapping from the individual users' 
state space and action space to real numbers, Un '■ Sn x An Tl- The state determination function 
s = XneAfSn maps joint actions to states for each component s„ : .A ^ >S„. To capture the performance 
tradeoff, the utility region is defined asU = {(ni(a), . . . , 'UAr(a))| 3 a = (ai, 02, . . . , ajv) £ A}. 

Definition 1: A multi-user interaction is considered a linearly coupled communication game if the 
action set An C 7?.+ is convex and the utility function u„ satisfies: 

Un(a) = a^" • s„(a), (1) 
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in which /3„ > 0. In particular, the basic assumptions about Sr!,(a) include: 
Al: Sn(a) is non-negative; 

A2: Denote s'^^{a.) = ^^^^ and s'^mi^) = Sn{sL) is strictly linear decreasing in amj^rn ^ n, 

i.e. sJim(a) < and s^„(a) = 0; s„(a) is non-increasing and linear in a„, i.e. s'^„i^) — ^'^'i ^nn(^) = 0- 
A3: ^f"^^^^ is an affine function, Vn G \ {m}. 

A4: <^ = <^,Vn,A: eA/'\{m}; <^ = or Vn / m. 

Assumptions Al and A2 indicate that increasing for any m ^ n within the domain of s„(a) will 
linearly decrease user n's utility. Assumptions A3 and A4 imply that a user's action has proportionally 
the same impact over the other users' utility. The structure of the utility functions that satisfy assumptions 
A1-A4 will be addressed in Section III. 



B. Illustrative Examples 

There are a number of multi-user communication scenarios that can be modeled as linearly coupled 
communication games. For example, in the random access scenario [15], the action of a node is to 
select its transmission probability and a node n will independently attempt transmission of a packet with 
transmit probability p„. The action set available to node n is An = [0, 1] for all n E N. In this case, the 
utility function is defined as 

As an additional example, in flow control [16], N Poisson streams of packets are serviced by a single 
exponential server with departure rate /i and each class can adjust its throughput r„. The utility function 
is defined as the weighted ratio of the throughput over the average experienced delay: 

N 

Un{r) = r^" ■ {fi- ^rm), (3) 

m=l 

in which /?„ > is interpreted as the weighting factor. Specifically, we can see that the state determination 
functions are s„(p) = l\m€Af\{n}i'^ - Pm) in © and s„(r) = fi- J2Z=i '^m in ©. It is straightforward 
to verify that these functions satisfy assumptions A1-A4 for both ^ and 

In this paper, we are interested in comparing the achievable performance attained by different game- 
theoretic solution concepts. On one hand, it is well-known that NE is generally inefficient in com- 
munication games [17], but it may not require explicit message exchanges, while Pareto-optimality can 
usually be achieved only by exchanging implicit or explicit coordination messages among the participating 
users. On the other hand, in several recent works [14] [15], we have applied an alternative solution 
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in different communication scenarios to improve the system performance in non-cooperative settings, 
namely the conjectural equilibrium [21]. The following sections aim to compare the solutions of NE, 
Pareto boundary, and CE in terms of the payoffs and informational requirements in the linearly coupled 
multi-user interaction satisfying the assumptions A1-A4. 

III. Computation of the Nash Equilibrium and Pareto Boundary for Linearly Coupled 



In this section, we show that the computation of the NE and the Pareto boundary in linearly coupled 
games is equivalent to solving linear equations. Specifically, we investigate the inherent structures of the 
utility functions satisfying assumptions A1-A4 and define two basic types of linearly coupled games. The 
performance loss incurred by the Nash strategy are quantified for Type II games. 

A. Nash Equilibrium 

In non-cooperative games, the participating users simply choose actions to selfishly maximize their 
individual utility functions. The steady state outcome of such interactions is an operating point, at which 
given the other users' actions, no user can increase its utility alone by unilaterally changing its action. 
This operating point is known as the Nash equilibrium, which is formally defined below [18]. 

Definition 2: A profile a of actions constitutes a Nash equilibrium of T if Un{an, a_„) > ii„(a[j, a_„) 
for all a'„ E An and n € J\f. 

We are interested in computing the NE in the linear coupled games. From equation ([T]), we have 



On one hand, if s[i„(a) = 0, Vn € J\f, since user n's utility function strictly increases in a„, we have 
trivial NE at which a* is the maximal element in An that lies in the domain of s(-), Vn € J\f. 

On the other hand, if s^„(a) / 0,Vn G J\f, according to assumption A3, since the multi-user 
interactions are linearly coupled, we have 



Games 




(4) 



Sn(a) = C(a_^)+5r(a-™K 



(5) 



where /™(a_m), ^^^(a-m) both polynomials and (7"(a_„) / 0. From this, it follows 



(6) 
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At NE, we have 

51og[ti„(a)] 



da. 



0,VnGA/'. (7) 



n 



Under assumption A3 and A4, is a affine function, which enables us to expUcitly characterize 

the NE. Denote = /i„(a_„). Equation d?]) can be rewritten as 

) + (/3„ + l)-a„ = 0,VnGA/'. (8) 

Therefore, the solutions of Equations ([U are the NE of the linearly coupled games and computing the 
NE is equivalent to solving A^-dimension linear equations. The following theorem indicates the inherent 
structure of the utility functions {un}n=i when the requirements A1-A3 are satisfied. 

Theorem 1: Under assumptions A1-A3, the irreducible factors of s„(a) over the integers are affine 
functions and have no variables in common. 

Proof: Denote the factorization of s„(a) as 



s„(a) = JJ^Ua), (9) 



i=l 

in which M„ represents the number of the non-constant irreducible factors in s„(a). Define V(-) as the 
mapping from a polynomial to the set of variables that appear in that polynomial. Based on assumption 
A2, we immediately have 

V(6i(a)) n V(6{(a)) = 0,Vi,j(i / i),n. 

Without loss of generality, we assume that aj € V(&,^(a)) and bl^{a.) = f^i (a-j) +5^1 (a_j)aj. Then 
fi{a^j),gi{a^j) in © are given by 

M,. M„ 

/n(a-i) = fbii^-j) • n^"(^)' ^"'^ 5n(a-i) = 5bja_.j) • Jj6;,(a). 

1=2 i=2 

Therefore, /'^~"'\ = — 4- By assumption A3, we have that the degree of ^!-^, — 4 is less than or 
equal to 1. Since ^^'^(a) is irreducible, we can conclude that g^^ (a_j) is a constant and the degree of 
fli (a_j) is less than or equal to 1. Note that the arguments above hold, Vj, n. Therefore, the degree of 
6^ (a) is one, Vn € J\f, i = 1, • • • , M„, which concludes the proof. ■ 



B. Pareto Boundary 

Since log(-) is concave and log[u„(a)] is a composition of affine functions [19], nn(a) is log-concave 
in a and the log-utility region \ogU is convex. Therefore, we can characterize the Pareto boundary of 
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the utiUty region as a set of a optimizing the following weighted proportional fairness objectiveLl: 

N 

maxV] a;„log[n„(a)], (10) 



a 

n=l 



for all possible sets of {ujn} satisfying w„ > and Yln=i — 1- Denote the optimal solution of problem 
([Tol l as a^^, which satisfies the following first-order condition: 



0,\/neAf, (11) 



dan 

Under assumptions A1-A3, the LHS of equation ([TT]) can be rewritten as 

= \ + } ^ i^k (12) 



By Theorem [T] and assumption A4, we have 

^ VA:GAA\{m}, (13) 



4m (a) _ 1 



in which V'm(a) is a affine function. Therefore, equation (fT2l ) is equivalent to 



^^^=1^^^ log[nfc(a)] 



Pmi-^m/am + (1 " a;m)/V'm(a), i/ Smm(a) = 0; 
/JmWm/am, + l/^/;m(a), otherwise. 



(14) 



We can compute the Pareto boundary of the linearly coupled games by solving linear equations: 

C>Y.k=l^k logK(a)] _ J /3mCJ„Vm(a) + (1 - UJm)am = 0, if S'mmi^) = 0; 

[ /3ma;mV'm(a) + am = 0, otherwise. 

Theorem [U reveals the structural properties of the utility functions {un}n=i when assumption A1-A3 
are satisfied. Based on Theorem [T] the following theorem further refines these properties of {un}n=i 
when the additional assumption A4 is imposed. 

Theorem 2: Under assumptions A1-A4, for any polynomial 6^ (a) in the factorization s„(a) = nf=i ^n(a)' 
Vn G M, if |V(6^(a))| > 2 or V(&*„(a)) = {a„}, 6*„(a) is an irreducible factor of Sm(a), Vm e AA; if 
V(6^(a)) = {am},rn 7^ n, h\^{a) is an irreducible factor of Sj(a), Vj G AA/{m}. 

Proof: By assumption A2, s^^(a) < 0,Vm, 7^ n, we have |V(s„(a))| > N — l,\/n G N. By 
Theorem [H the irreducible factors of s„(a) have no common variables and they are affine functions. 
Suppose |V(6^(a))| > 2 and {am,ai} G V(6*„(a). By assumption A4, we know that ^fj^ = ^f^^ = 
V"(a)^ , Vn, k ^ J\f\ {m}. Therefore, it follows 

'Note that the utility region U is not necessarily convex. Therefore, its Pareto boundary may not be characterized by the 
weighted sum of {u„(a)}^^i. 
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Since 6^„(a) is a constant, we can see that 6^ (a) is an irreducible factor of Sfc(a), Vfc € \ {m}. 
By symmetry, we can conclude that 6^(a) must also be an irreducible factor of Sfe(a), \/k J\f\ {I}. 
Therefore, 6^ (a) is an irreducible factor of Sfc(a), Vfc G Af. Similarly, we can prove the remaining parts 
of Theorem |2l ■ 

Remark 1: For the linearly coupled games satisfying assumptions A1-A4, suppose we factorize all 
users' state functions. Theorem |2] indicates that any factor with at least two variables must be a common 
factor of all the users' state functions, and any factor with a single variable must be a common factor 
of state functions for users excluding k. In reality, it corresponds to the communication scenarios in 
which the state, i.e. the multi-user coupling, is impacted by a set of users that result in a similar signal 
to all the users. 

We define two basic types of linearly coupled games satisfying the assumptions A1-A4. In Type I 
games, user A;'s action linearly decreases all the users' states but itself. Hence, the utility functions take 
the form 

Un{a) = a^" ■ Y[ ifJ-m - Tmam)- (17) 

In Type II games, all the users share the same non-factorizable state function and their utility functions 
are given by 

N 

Un{a) = a^" • (/X - ^ r„,am)- (18) 

m=l 

As special examples, the random access problem in ^ belongs to Type I games and the rate control 
problem in ([3]l belongs to Type II games. In fact, all the games that have the properties A1-A4 can be 
viewed as compositions of these two basic types of games (See the example in Remark [T]). Therefore, 
investigating the two basic types provides us the fundamental understanding of the linearly coupled multi- 
user interaction. A brief summary of the properties of Type I games will be provided in Section IV.E. 
For the details about its various game-theoretic solutions, we refer the readers to [15] and the references 
therein. The rest of this paper will focus on Type II games. 

C. Nash Equilibrium and Pareto Boundary in Type II Games 
For Type II games with utility functions given in ([TSl l. we have 

^nn (^) _ ~'^n 

Therefore, Equation ^ can be reduced to 

(1 + /3n)T„a„ + /?n ^ Tjnam = /^nM, Vn G M 



(19) 
(20) 
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The solution of the linear equations gives the NE, and its closed form has been addressed in [22] for 
Tn = l,Vn G AA. For the general case, it is easy to verify that the NE is given by 



r = — ,VneAA. (21) 



Similarly, to compute the Pareto boundary of Type II games. Equation ([T4l) can be reduced to 

(1 + a;„/?„)T„a„ + UnPn ^ T^Om = w„/3„/U, Vn G J\f. (22) 

The solution is given by 

= n —,\/neM. (23) 

From Section II.B, we know that the region logU is convex. Therefore, we can compare the efficiency 
of and a^^ using the system-utility metric X]^=i '^n ^°s[^n(^)]- Specifically, we have 

n=l """^^ ^ n=l Wn(l + Ei=l/?j) 1 + Ei=l /^j 

Denote t^o = 1, xq = i , v^Tv' ;L \ it'n = and x„ = — 7T-f;V-^, Vn e AA. Therefore, 



n=l n=l n=0 n=0 

Using the inequalities among the arithmetic, geometric and harmonic means [24], we have 



(1 + En=l ^^/?n)(l + E:=1 /3n) E„=0 " n=0 " E 



(25) 



(26) 



n=0 



Both inequalities hold with equality if and only if xq = xi = . . . = xat, i.e. = . . . = wat = 1. 
However, since we require E^=i = 1, (l26l ) holds as strict inequalities, which leads to 

n=l (l + En=l'^n/3n)(l + En=l/5n) „=i ^"1^ j 

Based on Equation (|27] ). we can make two important observations. First, due to the lack of coordination, 
the NE in Type II games is always strictly Pareto inefficient. Second, as opposed to Type I games where 
NE may result in zero utility for certain users [15], the efficiency loss in Type II games are lower bounded, 
which means that every user receives positive payoff at NE. Noticing that the performance gap between 
ti„(a^^) and nn(a^^) is non-zero, we will investigate how the non-cooperative CE solution can improve 
the system performance for Type II games. 
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IV. Conjectural Equilibrium for the Linearly Coupled Games 

A. Definitions 

In game-theoretic analysis, conclusions about the reached equiUbria are based on assumptions about 
what knowledge the players possess. For example, the standard NE strategy assumes that every player 
believes that the other players' actions will not change at NE. Therefore, it chooses to myopically 
maximize its immediate payoff [18]. Therefore, the players operating at equiUbrium can be viewed as 
decision makers behaving optimally with respect to their beliefs about the strategies of other players. 

To avoid detrimental Nash strategy and encourage cooperation, the conjecture-based model has been 
introduced by Wellman and others [20] [21] to enable non-cooperative players to build belief models 
about how their competitors' reactions vary in response to their own action changes. Specifically, each 
player has some belief about the state that would result from performing its available actions. The belief 
function Sn is defined to be s„ : An —>■ >5„ such that s„(a„) represents the state that player n beheves it 
would result in if it selects action a„ . Notice that the beliefs are not expressed in terms of other players' 
actions and preferences, and the multi-user coupling in these beliefs is captured indirectly by individual 
players forming conjectures of the effects of their own actions. By deploying such a behavior model, 
players will no longer adopt myopic behaviors that do not forecast s„, but rather they will form behefs 
Snicin) about how their actions a„ will influence the aggregate effects s„ incurred by their competitors' 
responses and, based on these beliefs, they will choose the action a„ G An if it believes that this action 
will maximize its utility. The steady state of such a play among belief-forming agents can be characterized 
as a conjectural equilibria. 

Definition 3: In the game T, a configuration of behef functions (s^, . . . , s^) and a joint action a* = 
(o*, . . . , a^) constitute a conjectural equilibrium, if for each n E Af, 

Sn(«n) = Sn{al, • • • , Oat) and a* = arg max (a„), a„). 

an&An 

From the above definition, we can see that, at CE, all players' expectations based on their beliefs are 
reaUzed and each agent behaves optimally according to its expectation. In other words, agents' behefs 
are consistent with the outcome of the play and they use "conjectured best responses" in their individual 
optimization program. The key challenges are how to configure the belief functions such that cooperation 
can be sustained in such a non-cooperative setting and how to design the evolution rules such that the 
communication system can dynamically converge to a CE having satisfactory performance. 
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B. Linear Beliefs 

As discussed before, the belief functions need to be defined in order to investigate the existence of CE. 
To define the belief functions, we need to express agent n's expected state s„ as a function of its own 
action a„. The simplest approach is to design linear belief models for each user, i.e. player n's belief 
function takes the form 

Sn(an) = Sn - A„(a„ - a„), (28) 

for n € N. The values of Sn and a„ are specific states and actions, called reference points and A„ is 
a positive scalar. In other words, user n assumes that other players will observe its deviation from its 
reference point a„ and the aggregate state deviates from the reference point s„ by a quantity proportional 
to the deviation of a„ — a„. How to configure s„, a„, and A„ will be addressed in the rest of this paper. 
We focus on the linear belief represented in ( [28] ). because this simple belief form is sufficient to drive 
the resulting non-cooperative equilibrium to the Pareto boundary. 

The goal of user n is to maximize its expected utility On" • Sn{an) taking into account the conjectures 
that it has made about the other users. Therefore, the optimization a user needs to solve becomes: 



max 



Ara(ffln ~ On) 



(29) 



For Afe > 0, user n believes that increasing an will further reduce its conjectured state Sn- The optimal 
solution of (|29l ) is given by 

A„(l + /3„) • ^■'"^ 



In the following, we first show that forming simple linear beliefs in (1281) can cause all the operating 
points in the achievable utility region to be CE. 

Theorem 3: For Type II games, all the positive operating points in the utility region U are essentially 
CE. 

Proof: For each positive operating point (nj, . . . (i.e. n* > 0, Vn € J\f) in the utility region U, 
there exists at least one joint action profile (a*, . . . ,a^) € A such that u* = u„(a*), Vn € M. We 
consider setting the parameters in the belief functions {sn{an)}n=i to be: 

A; = /3„. ^ ^"^=1 VngAA. (31) 



It is easy to check that, if the reference points are s„ = /u — J2m=i '^mO-m^ = ^n' have Snia^) = 
Sn{al, . . . , a^) and a* = arg raaXa^eAr, Un{sn{an),an)- Therefore, this belief function configuration and 
the joint action a* = (aj, . . . , a*^) constitute the CE that results in the utility {u\, . . . , u^). ■ 
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Theorem [3] establishes the existence of CE, i.e. for a particular a* € A, how to choose the parameters 
{sn,an, \n}n=i ^uch that a* is a CE. However, it neither tells us how these CE can be achieved and 
sustained in the dynamic setting nor clarifies how different belief configurations can lead to various CE. 

We consider the dynamic scenarios in which users revise their reference points based on their past 
local observations over time. Let s^, a^, sf^, s^, be user n's state, action, belief function, and reference 
points at stage t, in which = — J2m=i '^mO-m- propose a simple rule for individual users to 
update their reference points. At stage t, user n sets its and to be and a^~^. In other words, 
user n's conjectured utility function at stage t is 



N 

' ^ ] (32) 

m=l 

Since we have defined the users' utility function at stage t, upon specifying the rule of how user n updates 
its action based on its utility function ufj(s^(a„), a„), the trajectory of the entire dynamic process 
is determined. The remainder of this paper will investigate the dynamic properties of the best response 
and Jacobi update mechanisms and the performance trade-off among the competing users at the resulting 
steady-state CE. In particular, for fixed {A„}^^^, Section IV-C derives necessary and sufficient conditions 
for the convergence of the best response and the Jacobi update dynamics. Section IV-D quantitatively 
describes the limiting CE for given {A^}^^^ and investigates how the parameters {A„}^^^ should be 
properly chosen such that Pareto efficiency can be achieved. 

C. Dynamic Algorithms 

1 ) Best Response: In the best response algorithm, each user updates its action using the best response 
that maximizes its conjectured utility function in ( [32l ). Therefore, at stage t, user n chooses its action 
according to 



We are interested in characterizing the convergence of the update mechanism defined by (1331) when using 
various An to initialize the belief function s„. 

To analyze the convergence of the best response dynamics, we consider the Jacobian matrix of the 
self-mapping function in (l33l) . Let Jj^ denote the element at row i and column k of the Jacobian matrix 
J. The elements of the Jacobian matrix J^^ of (l33l) are defined as: 

RR da' f ¥utPy^ if i = k, 
^ I A,(l+/3.)' " ^ ^ 
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For Type II games, the following theorem gives a necessary and sufficient condition under which the 
best response dynamics defined in (1331 ) converges. 

Theorem 4: For Type II games, a necessary and sufficient condition for the best response dynamics 
to converge is 



N 

E 



< 1. 



(35) 



^^A„(l + 2/3„) 

Proof: The best response dynamics converges if and only if the eigenvalues of the Jacobian 

matrix J^^ in (l34l ) are all inside the unit circle of the complex plane [25], i.e. \S,n^'\ < 1, Vn € M. To 
determine the eigenvalues of J^^, we have 



det(e/-J 



BR\ 



_ /3i(Ai-ri) 
Ai(l+/3i) 



/3jvTl 

An(1+/3n) 

_ /3i(Ai-Ti) 
Ai(l+/3i) 

/32T1 



A2(l+/32) 
/3]VTl 

Ajv(l+/3«) 
/3 



Ai(l+/3i) 

_ A(A2-T2) 
A2(l+/32) 



/?WT2 

Ajv(l+/3jv) 

/32 



22. 

Tl V 1+/3; 



1+/32 



N 



Ai(l+/3i) 

A2(l+/32) 
/3w(AjV— Tjv) 

A„(l+/3„) 



Tl U+/3; 



i+/3i) ■ Sn=i wiri+s 



0- 







/3iv 
l+/3iv 



1+/32 



A«(l+/3jv) 

Therefore, we can see that, the eigenvalues of J^^ are the roots of 



N 



N 



n=l ^ri\^ /3„ 



1 



fin 



n=l 



l + /3n 



)=0. 



(36) 



Denote q{d = Yl,n=i ITiT^f+aZTT- First, we assume that (3i / j- Without loss of generality, con- 

sider (3i < {32 < ■ ■ ■ < Pn- In this case, the eigenvalues of J^^ are the roots of q{S^) = 1. Note that q{^) 
is a continuous function and it strictly increases in (— cxd, j^^), (t+^; T+/3^)' ' ' ' ' ( i+fe^ i ' T+f^)' ^^'^ 



( -if^ , +oo). We also have lim^ ^ g(^) = +cx), liiiif / 3„ (?(^) = — oo, n = 1, 2, • • • , A^, and 

limg__oog(0 = Iim5^+oo9(0 = 0. Therefore, the roots of q{0 = 1 lie in (-00, j^), {j^, j^), 
■ • • . (t+I^' T+f^)- '^i"'^^ l(0 strictly increases in (-00, jf^), we have l^^^^l < 1, Vn G if and 



-/3jv-i' 1+/3 

only if q{—l 



A„(l+2/3„) 



< 1. 
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Second, we consider the cases in which there exists /3j = (3j for certain i, j. Suppose that {Pn}n=i t^^e 
K discrete values /ti, • • • ,kk and the number of {j3n}n=i that equal to Kk is Uk- In this case, Equation 
( l36b is reduced to 

TV K 

\y ll •TT(e-T^)"'=0- (37) 

Hence, equation g(^) = 1 has N + K — J2k=i "-fc roots in total, and ^ = is a root of multiplicity 
rifc — 1 for Equation ( [371 ). V/c. All these roots are the eigenvalues of matrix J^^. Similarly, the roots of 
q{0 = 1 lie in (-oo, (^^^, A necessary and sufficient condition 

under which \^^^\ < 1, Vn G A/" is still q{-l) < 1, i.e. J2n=i xji+w^) < 1" " 

Remark 2: Theorem |4] indicates that, if the condition in (1351 ) is satisfied, the best response dynamics 
converges linearly to the CE. The convergence rate is mainly determined by max„g^ |^^^|. Suppose 
Pi < l32 < ■ ■ ■ < Pn and ^ < ^ < • • • < From the proof of Theorem IH we can see 

that, under condition m, -1 < ^ < ll^ < ^2'''' < • ' ' < ^n""' and < < 

Therefore, the rate of convergence can be approximated by max{[^^^|, [^^^|}. Note that choosing 
larger {A„}^^^ increases ^i^. Hence, if —1 < < — l^^^^l, increasing {A„}^^^, i.e. having more 
self-constraint users, accelerate the convergence rate of the best response mechanism. On the other hand, 
since > , ^^'^ , the convergence rate is lower bounded by , ^^'^ ■ Therefore, if more than two 
users associate large weighting factors /? with their individual actions in the utility functions, we have 
-1^;^^"^ ^ — i> 1 and the best response dynamics converges slowly. 

Remark 3: Theorem |4] generalizes the necessary and sufficient condition derived in [22], where users 
are assumed to be symmetric, i.e. r„ = 1, Vn and they adopt the Nash strategy by choosing A„ = t„, Vn. 
Due to lack of symmetry, the derivation in [22] is not readily applicable to analyze the convergence of 
the best response dynamics. The proof of Theorem |4l instead directly characterizes the eigenvalues of 
the Jacobian matrix, and hence, provides a more general convergence analysis of the dynamic algorithms 
that allow users to update their actions based on their independent linear conjectures. 

Remark 4: In Type II games, a locally stable CE is also globally convergent, which is purely due to 
the property of its utility functions specified in ( [TSl l. From ([34] ). we can see that all the elements in J^^ 
are independent of the joint play a*~^. This is in contrast with Type I games considered in [15], where 
local stabihty of a CE may not imply its global convergence and the best response dynamics may only 
converge if the operating point is close enough to the steady-state equilibrium. 

2) Jacobi Update: We consider another alternative strategy update mechanism called Jacobi update 
[23]. In Jacobi update, every user adjusts its action gradually towards the best response strategy. At stage 
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t, user n chooses its action according to 

ai = J„(a*-i) := aJr' + e[5n(a*-^) - a''^], (38) 

in which the stepsize e > and i?„(a*^^) is defined in ( [33l ). The following theorem establishes the 
convergence property of the Jacobi update dynamics. 

Theorem 5: In Type II games, for given {r„, /3„, \n}^^i, the Jacobi update dynamics converges if the 
stepsize e is sufficiently small. 

Proof: The Jacobian matrix i"^^ of the self-mapping function (|38] ) satisfies J"^^ = (1 — e)I + ej^^. 
Therefore, its eigenvalues {S^^^}^^^ are given by ^'^^ = 1 — e + e^^^. From the proof of Theorem |4l 
we know that < 1, Vn G A/". Therefore, if e < i.^in ^^^^ G (-1, 1), Vn G A/ and the 

Jacobi update dynamics converges. ■ 

Remarks.- Theorem |5] indicates that, for any {t„, /3„, A„}^^^ > 0, the Jacobi update mechanism 
globally converges to a CE as long as the stepsize is set to be a small enough positive number. In other 
words, the small stepsize in the Jacobi update can compensate for the instability of the best response 
dynamics even though the necessary and sufficient condition in ( [35l) is not satisfied. 



D. Stability of the Pareto Boundary 

In order to understand how to properly choose the parameters {Xn}n=i such that it leads to efficient 
outcomes, we need to explicitly describe the steady-state CE in terms of the parameters {A„}^^^ of the 
beUef functions. Denote the joint action profile at CE as (aj, . . . , a^). From Equation (1331) . we know 
that 

(A„ + /3nT„)a* + ^ f^nTma'^ = Pn^^^n G M. (39) 
m&J\f\{n} 

The solutions of the above linear equations are 

a^^ = ^, Vn G N. (40) 

Based on the closed-form expression of the CE, the following theorem indicates the stability of the Pareto 
boundary in Type II games. 

Theorem 6: For Type II games, all the operating points on the Pareto boundary are globally convergent 
CE under the best response dynamics. 

Proof: Comparing Equations ( [231 ) and ( |40l ). we can see that, (af^^, . . . ,a^^) = (af^, • • • if 
and only if A„ = Tn/uJn- Substitute it into the LHS of (l35l) : 

A„(l + 2/?„) l + 2/3n 2 2- ^ ^ 
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Condition (1351 ) is satisfied for all the Pareto-optimal operating points. In fact, we have min„ = 0, 
which is because q{0) = Yln=i Y' — Sn^=i = 1- Therefore, under the best response dynamics, the 
Pareto boundary is globally convergent. ■ 

In addition, we also note that Theorem [5] already indicates the stability of the Pareto boundary under 
Jacobi update as long as the parameters {r„,/3„, Xn}n=i are properly chosen. 

Remark 6: Since J2n=i "^n ~ ^' '^^^ from the previous proof that, the belief configurations 
{Xn}n=i lead to Pareto-optimal operating points if and only if 

N 

E = 1- (42) 

n=l 

Therefore, we can see that, to achieve Pareto-optimality in these non-cooperative scenarios, users need 
to choose the belief parameters {A„}^^^ to be greater than or equal to the parameters {t„}^^^ in the 
utility function {un}n=i and the summation of should be equal to 1. Define user n's conservativeness 
as Y', which reflects the ratio between the immediate performance degradation — r„Aa„ in the actual 
utility function and the long-term effect — A„Aa„ in the conjectured utility function if user n increases 
its action by Aa„. The condition in Equation (l42l ) indicates that, to achieve efficient outcomes, the 
non-collaborative users need to jointly maintain moderate conservativeness by considering the multi- 
user coupling and appropriately choosing {Xn}^=i- By "moderate", we mean that users are neither too 
aggressive, i.e. A„ Tn and Ylin=i IT ~^ conservative, i.e. A„ —^ +oo and Yln=i F" ^ 

If more than one user plays the Nash strategy and choose An = Tn, Equation (|42l) does not hold and the 
resulting operating point is not Pareto-optimal. Therefore, myopic selfish behavior is detrimental. 
Similarly as in ((24l) . we have 

^ , nn(a^^) ^ rn(l + Ef=i^./3,) 1 + Ef=i 

n=l "'^ ' n=l ■^n^n\^ ^ l^j=\ > ^ ^ l^j=l 

Using Jensen's inequaUty, we can conclude Y.n=i log ^"[^Pi;| < and YZ=i log ""uX^pb] = 
if and only if Wn = j^,Vn. Therefore, if a CE is Pareto efficient, user n's conservativeness Tn/\n 
corresponds to the weight assigned to user n in the weighted proportional fairness defined in (ITOl ). 

As an illustrative example, we simulate a three-user system with parameters /? = [1.5 1 0.5], r = 
[3 4 5], /i = 10, LOn = ^, Vn. In this case, the joint actions and the corresponding utilities at NE and Pareto 
boundary are summarized in Table I. The price of anarchy quantified according to ( [271 ) is —0.2877 and 
the lower bound in ( |27] ) is —0.5754. As discussed in Section III.C, both the upper bound and lower 
bound in (|27] ) are not tight. Fig. [T] shows the trajectory of the action updates under both best response 
and Jacobi update dynamics, in which a!^ = 0.5, An = ^,Vn, and e = 0.5. The best response update 
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converges to the Pareto-optimal operating point in around 8 iterations and the Jacobi update experiences 
a smoother trajectory and the same equilibrium is attained after more iterations. 

E. Discussions 

1) Comparison between Type I and Type II games: As mentioned before, the properties of Type I 
games have been investigated in the context of wireless random access [15]. Table II summarizes some 
similarities and differences between both types of games. First, the two algorithms exhibit different 
properties under the best response dynamics. In Type I games, the stable CE may not be globally 
convergent. However, the local stability of a CE implies its global convergence in Type II games. Second, 
it is shown in [15] that any operating point that is arbitrarily close to the Pareto boundary of the utility 
region of Type I games is a stable CE. Similarly, the entire Pareto boundary of Type II games is also stable. 
At last, different relationships between the parameter selection and the achieved utility at equilibrium 
have been observed for the two types of games. In particular, in Type I games, user n's utility u„ is 
approximately proportional to the inverse of the parameter A„ in its belief function. In contrast, in Type 
II games, if the CE is Pai^eto-optimal, the ratio x„/A„ coincide with the weight Un assigned to user n 
in the proportional fairness objective function. In other words, based on the definition of proportional 
fairness [26], we know 



in which {u[,U2, . . . , u'j^) is the users' achieved utility associated with any other feasible joint action and 
{ul,U2, . . . , u*^) is the optimal achieved utility for problem ( [TOt with tOn = Tn/Xn and ^2^=1 = 1- 

2 ) Pricing Mechanism vs. Conjectural Equilibrium: In order to achieve Pareto-optimality, information 
exchanges among users is generally required in order to collaboratively maximize the system efficiency. 
The existing cooperative communication scenarios either assume that the information about all the users 
is gathered by a trusted moderator (e.g. access point, base station, selected network leader etc.), to which 
it is given the authority to centrally divide the available resources among the participating users, or, in 
the distributed setting, users exchange price signals (e.g. the Lagrange multipliers for the dual problem) 
that reflect the "cost" for consuming per unit constrained resources to maximize the social welfare and 
reach Pareto-optimal allocations. As an important tool, the pricing mechanism has been applied in the 
distributed optimization of various communication networks [12]. However, we would like to point out 
that, the pricing mechanism generally requires repeated coordination information exchange among users 
in order to determine the optimal actions and achieve the Pareto-optimality. In contrast, for the linear 




(44) 
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coupled communication games, since the specific structure of the utility function is explored, the CE 
approach is able to calculate the Pareto efficient operating point in a distributed manner, without any real- 
time information exchange among users. In fact, the underlying coordination is implicitly implemented 
when the participating users initialize their belief parameters. Once the belief parameters are properly 
initialized by the protocol according to (l42l ). using the proposed dynamic update algorithms, individual 
users are able to achieve the Pareto-optimal CE solely based on their individual local observations on 
their states and no message exchange is needed during the convergence process. Therefore, the conjecture 
equilibrium approach is an important alternative to the pricing-based approach in the linearly coupled 
games. 

V. Conclusion 

We derive the structure of the utility functions in the multi-user communication scenarios where a user's 
action has proportionally the same impact over other users' utilities. The performance gap between NE 
and Pareto boundary of the utility region is explicitly characterized. To improve the performance in 
non-cooperative cases, we investigate a CE approach which endows users with simple linear beliefs 
which enables them to select an equiUbrium outcome that is efficient without the need of explicit 
message exchanges. The properties of the CE under both the best response and Jacobi dynamic update 
mechanisms are characterized. We show that the entire Pareto boundary in linearly coupled games is 
globally convergent CE which can be achieved by both studied dynamic algorithms without the need 
of real-time message passing. A potential future direction is to see how to extend the CE approach to 
the general linearly coupled games that are compositions of the basic two types and certain particular 
non-linearly coupled multi-user communication scenarios. 
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TABLE I 

Actions and payoffs at NE and Pareto boundary. 





User 1 


User 2 


User 3 




1.25 


0.625 


0.25 




3.4939 


1.5625 


1.25 




0.833 


0.417 


0.167 




3.8036 


2.0833 


2.0412 



TABLE II 

Comparison between Type I and Type II games. 



Games 


Best response dynamics 


Stability vs. efficiency 


Fairness vs. parameter selection 


Type I 


local stability <^ global convergence 


stable at near-Pareto-optimal points 


tin OC T„/Xn 


Type II 


local stability global convergence 


stable at the Pareto boundary 


iOn = rn/\n at the Pareto boundary 
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Abstract 

This paper discusses a special type of multi-user communication scenario, in which users' utilities 
are linearly impacted by their competitors' actions. First, we explicitly characterize the Nash equilibrium 
and Pareto boundary of the achievable utility region. Second, the price of anarchy incurred by the 
non-collaborative Nash strategy is quantified. Third, to improve the performance in the non-cooperative 
scenarios, we investigate the properties of an alternative solution concept named conjectural equiUbrium, 
in which individual users compensate for their lack of information by forming internal beliefs about 
their competitors. The global convergence of the best response and Jacobi update dynamics that achieve 
various conjectural equilibria are analyzed. It is shown that the Pareto boundaries of the investigated 
linearly coupled games can be sustained as stable conjectural equilibria if the belief functions are properly 
initialized. The investigated models apply to a variety of realistic applications encountered in the multiple 
access design, including wireless random access and flow control. 

Index Terms 

Nash equilibrium, Pareto-optimaUty, conjectural equilibrium, non-cooperative games. 

I. Introduction 

Game theory provides a formal framework for studying the interactions of strategic agents. Recently, 
there has been a surge in research activities that employ game theory to model and analyze a wide range 
of application scenarios in modern communication networks [I]- [4]. In communication networks, any 
action taken by a single user usually affects the utilities of the other users sharing the same resources. 
Depending on the characteristics of different applications, numerous game-theoretical models and solution 
concepts have been proposed to describe the multi-user interactions and optimize the users' decisions in 
communication networks. Roughly speaking, the existing multi-user research can be categorized into two 
types, non-cooperative games and cooperative games. Various game theoretic solutions were developed 
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to characterize the resulting performance of the multi-user interaction, including the Nash Equihbrium 
(NE) and the Pareto-optimality [18]. 

Non-cooperative approaches generally assume that the participating users simply choose actions to 
selfishly maximize their individual utility functions. It is well-known that if devices operate in a non- 
cooperative manner, this will generally limit their performance as well as that of the whole system, because 
the available resources are not always efficiently exploited due to the conflicts of interest occurring among 
users [5]. Most non-cooperative approaches are devoted to investigating the existence and properties of 
the NE. In particular, several non-cooperative game models, such as S-modular games, congestion games, 
and potential games, have been extensively apphed in various communication scenarios [6]- [9]. The price 
of anarchy, a measure of how good the system performance is when users play selfishly and reach the 
NE instead of playing to achieve the social optimum, has also been addressed in several cormnunication 
network applications [10] [11]. 

On the other hand, cooperative approaches in communication theory usually focus on studying how 
users can jointly improve their performance when they cooperate. For example, the users may optimize a 
common objective function, which represents the Pareto-optimal social welfare allocation rule based on 
which the system- wide resource allocation is performed [12] [13]. A profile of actions is Pareto-optimal 
if there is no other profile of actions that makes every player at least as well off and at least one player 
strictly better off. Allocation rules, e.g. network utility maximization, can provide reasonable allocation 
outcomes by considering the trade-off between fairness and efficiency. Most cooperative approaches focus 
on studying how to efficiently find the optimum joint policy. It is worth mentioning that information 
exchanges among users is generally required to enable users to coordinate in order to achieve and sustain 
Pareto-efficient outcomes. 

In this paper, we present a game model for a particular type of non-cooperative multi-user communi- 
cation scenario. We name it linearly coupled communication games, because users' utihties are linearly 
impacted by their competitors' actions. In particular, the main contributions of this paper are as follows. 
First, based on the assumptions that we make about the properties of users' utihty, we characterize 
the inherent structures of the utility functions for the hnearly coupled games. Furthermore, based on 
the derived utihty forms, we explicitly quantify the NE and Pareto boundary for the hnearly coupled 
cormnunication games. The price of anarchy incurred by the selfish users playing the Nash strategy is 
quantified. In addition, to improve the performance in the non-cooperative scenarios, we investigate an 
alternative solution: conjectural equilibrium (CE). Using this approach, individual users are modeled as 
behef-forming agents that develop internal behefs about their competitors and behave optimally with 
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respect to their individual beliefs. Necessary and sufficient conditions that guarantee the convergence of 
different dynamic update mechanisms, including the best response and Jacobi update, are addressed. We 
prove that these adjustment processes based on conjectures and non-cooperative individual optimization 
can be globally driven to Pareto-optimality in the linearly coupled games without the need of real-time 
coordination information exchange among agents. 

The rest of this paper is organized as follows. Section II defines the linearly coupled communication 
games. For the investigated game models. Section m explicitly computes the NE and Pareto boundary 
of the achievable utility region and quantifies the price of anarchy. Section TV introduces the CE and 
investigates its properties under both the best response and Jacobi update dynamics. Conclusions are 
drawn in Section V. 

II. Game Model 

In this section, we first provide a general game-theoretic formulation of the multi-user interaction in 
communication systems. Following the proposed definition, we define the linearly coupled communication 
games and provide concrete examples of the investigated game model. 

A. Linearly Coupled Communication Games 

The multi-user game in various communication scenarios can be formally defined as a tuple F = 
{J\f,A,u,S,s). In particular, Af = {1,2, .. . ,N} is the set of communication devices, which are the 
rational decision-makers in the system. Define A to be the joint action space .4 = x neAfAn, with An 
being the action set available for user n. As opposed to the traditional strategic game definition [18], 
we introduce two new elements S and s into the game formulation. Specifically, S is the state space 
•S = ^neAf^n, where Sn ^ is the part of the state relevant to user n. The state is defined to 
capture the effects of the multi-user coupling such that each user's utility solely depends on its own state 
and action. In other words, the utihty function u = XneAfUn is a mapping from the individual users' 
state space and action space to real numbers, Un '■ Sn x An Tl- The state determination function 
s = XneAfSn maps joint actions to states for each component s„ : .A ^ >S„. To capture the performance 
tradeoff, the utility region is defined asU = {(ni(a), . . . , 'UAr(a))| 3 a = (ai, 02, . . . , ajv) £ A}. 

Definition 1: A multi-user interaction is considered a linearly coupled communication game if the 
action set An C 7?.+ is convex and the utility function u„ satisfies: 

Un(a) = a^" • s„(a), (1) 



4 



in which /3„ > 0. In particular, the basic assumptions about Sr!,(a) include: 
Al: Sn(a) is non-negative; 

A2: Denote s'^^{a.) = ^^^^ and s'^mi^) = Sn{sL) is strictly linear decreasing in amj^rn ^ n, 

i.e. sJim(a) < and s^„(a) = 0; s„(a) is non-increasing and linear in a„, i.e. s'^„i^) — ^'^'i ^nn(^) = 0- 
A3: ^f"^^^^ is an affine function, Vn G \ {m}. 

A4: <^ = <^,Vn,A: eA/'\{m}; <^ = or Vn / m. 

Assumptions Al and A2 indicate that increasing for any m ^ n within the domain of s„(a) will 
linearly decrease user n's utility. Assumptions A3 and A4 imply that a user's action has proportionally 
the same impact over the other users' utility. The structure of the utility functions that satisfy assumptions 
A1-A4 will be addressed in Section III. 



B. Illustrative Examples 

There are a number of multi-user communication scenarios that can be modeled as linearly coupled 
communication games. For example, in the random access scenario [15], the action of a node is to 
select its transmission probability and a node n will independently attempt transmission of a packet with 
transmit probability p„. The action set available to node n is An = [0, 1] for all n E N. In this case, the 
utility function is defined as 

As an additional example, in flow control [16], N Poisson streams of packets are serviced by a single 
exponential server with departure rate /i and each class can adjust its throughput r„. The utility function 
is defined as the weighted ratio of the throughput over the average experienced delay: 

N 

Un{r) = r^" ■ {fi- ^rm), (3) 

m=l 

in which /?„ > is interpreted as the weighting factor. Specifically, we can see that the state determination 
functions are s„(p) = l\m€Af\{n}i'^ - Pm) in © and s„(r) = fi- J2Z=i '^m in ©. It is straightforward 
to verify that these functions satisfy assumptions A1-A4 for both ^ and 

In this paper, we are interested in comparing the achievable performance attained by different game- 
theoretic solution concepts. On one hand, it is well-known that NE is generally inefficient in com- 
munication games [17], but it may not require explicit message exchanges, while Pareto-optimality can 
usually be achieved only by exchanging implicit or explicit coordination messages among the participating 
users. On the other hand, in several recent works [14] [15], we have applied an alternative solution 
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in different communication scenarios to improve the system performance in non-cooperative settings, 
namely the conjectural equilibrium [21]. The following sections aim to compare the solutions of NE, 
Pareto boundary, and CE in terms of the payoffs and informational requirements in the linearly coupled 
multi-user interaction satisfying the assumptions A1-A4. 

III. Computation of the Nash Equilibrium and Pareto Boundary for Linearly Coupled 



In this section, we show that the computation of the NE and the Pareto boundary in linearly coupled 
games is equivalent to solving linear equations. Specifically, we investigate the inherent structures of the 
utility functions satisfying assumptions A1-A4 and define two basic types of linearly coupled games. The 
performance loss incurred by the Nash strategy are quantified for Type II games. 

A. Nash Equilibrium 

In non-cooperative games, the participating users simply choose actions to selfishly maximize their 
individual utility functions. The steady state outcome of such interactions is an operating point, at which 
given the other users' actions, no user can increase its utility alone by unilaterally changing its action. 
This operating point is known as the Nash equilibrium, which is formally defined below [18]. 

Definition 2: A profile a of actions constitutes a Nash equilibrium of T if Un{an, a_„) > ii„(a[j, a_„) 
for all a'„ E An and n € J\f. 

We are interested in computing the NE in the linear coupled games. From equation ([T]), we have 



On one hand, if s[i„(a) = 0, Vn € J\f, since user n's utility function strictly increases in a„, we have 
trivial NE at which a* is the maximal element in An that lies in the domain of s(-), Vn € J\f. 

On the other hand, if s^„(a) / 0,Vn G J\f, according to assumption A3, since the multi-user 
interactions are linearly coupled, we have 



Games 




(4) 



Sn(a) = C(a_^)+5r(a-™K 



(5) 



where /™(a_m), ^^^(a-m) both polynomials and (7"(a_„) / 0. From this, it follows 



(6) 
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At NE, we have 

51og[ti„(a)] 



da. 



0,VnGA/'. (7) 



n 



Under assumption A3 and A4, is a affine function, which enables us to expUcitly characterize 

the NE. Denote = /i„(a_„). Equation d?]) can be rewritten as 

) + (/3„ + l)-a„ = 0,VnGA/'. (8) 

Therefore, the solutions of Equations ([U are the NE of the linearly coupled games and computing the 
NE is equivalent to solving A^-dimension linear equations. The following theorem indicates the inherent 
structure of the utility functions {un}n=i when the requirements A1-A3 are satisfied. 

Theorem 1: Under assumptions A1-A3, the irreducible factors of s„(a) over the integers are affine 
functions and have no variables in common. 

Proof: Denote the factorization of s„(a) as 



s„(a) = JJ^Ua), (9) 



i=l 

in which M„ represents the number of the non-constant irreducible factors in s„(a). Define V(-) as the 
mapping from a polynomial to the set of variables that appear in that polynomial. Based on assumption 
A2, we immediately have 

V(6i(a)) n V(6{(a)) = 0,Vi,j(i / i),n. 

Without loss of generality, we assume that aj € V(&,^(a)) and bl^{a.) = f^i (a-j) +5^1 (a_j)aj. Then 
fi{a^j),gi{a^j) in © are given by 

M,. M„ 

/n(a-i) = fbii^-j) • n^"(^)' ^"'^ 5n(a-i) = 5bja_.j) • Jj6;,(a). 

1=2 i=2 

Therefore, /'^~"'\ = — 4- By assumption A3, we have that the degree of ^!-^, — 4 is less than or 
equal to 1. Since ^^'^(a) is irreducible, we can conclude that g^^ (a_j) is a constant and the degree of 
fli (a_j) is less than or equal to 1. Note that the arguments above hold, Vj, n. Therefore, the degree of 
6^ (a) is one, Vn € J\f, i = 1, • • • , M„, which concludes the proof. ■ 



B. Pareto Boundary 

Since log(-) is concave and log[u„(a)] is a composition of affine functions [19], nn(a) is log-concave 
in a and the log-utility region \ogU is convex. Therefore, we can characterize the Pareto boundary of 
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the utiUty region as a set of a optimizing the following weighted proportional fairness objectiveLl: 

N 

maxV] a;„log[n„(a)], (10) 



a 

n=l 



for all possible sets of {ujn} satisfying w„ > and Yln=i — 1- Denote the optimal solution of problem 
([Tol l as a^^, which satisfies the following first-order condition: 



0,\/neAf, (11) 



dan 

Under assumptions A1-A3, the LHS of equation ([TT]) can be rewritten as 

= \ + } ^ i^k (12) 



By Theorem [T] and assumption A4, we have 

^ VA:GAA\{m}, (13) 



4m (a) _ 1 



in which V'm(a) is a affine function. Therefore, equation (fT2l ) is equivalent to 



^^^=1^^^ log[nfc(a)] 



Pmi-^m/am + (1 " a;m)/V'm(a), i/ Smm(a) = 0; 
/JmWm/am, + l/^/;m(a), otherwise. 



(14) 



We can compute the Pareto boundary of the linearly coupled games by solving linear equations: 

C>Y.k=l^k logK(a)] _ J /3mCJ„Vm(a) + (1 - UJm)am = 0, if S'mmi^) = 0; 

[ /3ma;mV'm(a) + am = 0, otherwise. 

Theorem [U reveals the structural properties of the utility functions {un}n=i when assumption A1-A3 
are satisfied. Based on Theorem [T] the following theorem further refines these properties of {un}n=i 
when the additional assumption A4 is imposed. 

Theorem 2: Under assumptions A1-A4, for any polynomial 6^ (a) in the factorization s„(a) = nf=i ^n(a)' 
Vn G M, if |V(6^(a))| > 2 or V(&*„(a)) = {a„}, 6*„(a) is an irreducible factor of Sm(a), Vm e AA; if 
V(6^(a)) = {am},rn 7^ n, h\^{a) is an irreducible factor of Sj(a), Vj G AA/{m}. 

Proof: By assumption A2, s^^(a) < 0,Vm, 7^ n, we have |V(s„(a))| > N — l,\/n G N. By 
Theorem [H the irreducible factors of s„(a) have no common variables and they are affine functions. 
Suppose |V(6^(a))| > 2 and {am,ai} G V(6*„(a). By assumption A4, we know that ^fj^ = ^f^^ = 
V"(a)^ , Vn, k ^ J\f\ {m}. Therefore, it follows 

'Note that the utility region U is not necessarily convex. Therefore, its Pareto boundary may not be characterized by the 
weighted sum of {u„(a)}^^i. 
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Since 6^„(a) is a constant, we can see that 6^ (a) is an irreducible factor of Sfc(a), Vfc € \ {m}. 
By symmetry, we can conclude that 6^(a) must also be an irreducible factor of Sfe(a), \/k J\f\ {I}. 
Therefore, 6^ (a) is an irreducible factor of Sfc(a), Vfc G Af. Similarly, we can prove the remaining parts 
of Theorem |2l ■ 

Remark 1: For the linearly coupled games satisfying assumptions A1-A4, suppose we factorize all 
users' state functions. Theorem |2] indicates that any factor with at least two variables must be a common 
factor of all the users' state functions, and any factor with a single variable must be a common factor 
of state functions for users excluding k. In reality, it corresponds to the communication scenarios in 
which the state, i.e. the multi-user coupling, is impacted by a set of users that result in a similar signal 
to all the users. 

We define two basic types of linearly coupled games satisfying the assumptions A1-A4. In Type I 
games, user A;'s action linearly decreases all the users' states but itself. Hence, the utility functions take 
the form 

Un{a) = a^" ■ Y[ ifJ-m - Tmam)- (17) 

In Type II games, all the users share the same non-factorizable state function and their utility functions 
are given by 

N 

Un{a) = a^" • (/X - ^ r„,am)- (18) 

m=l 

As special examples, the random access problem in ^ belongs to Type I games and the rate control 
problem in ([3]l belongs to Type II games. In fact, all the games that have the properties A1-A4 can be 
viewed as compositions of these two basic types of games (See the example in Remark [T]). Therefore, 
investigating the two basic types provides us the fundamental understanding of the linearly coupled multi- 
user interaction. A brief summary of the properties of Type I games will be provided in Section IV.E. 
For the details about its various game-theoretic solutions, we refer the readers to [15] and the references 
therein. The rest of this paper will focus on Type II games. 

C. Nash Equilibrium and Pareto Boundary in Type II Games 
For Type II games with utility functions given in ([TSl l. we have 

^nn (^) _ ~'^n 

Therefore, Equation ^ can be reduced to 

(1 + /3n)T„a„ + /?n ^ Tjnam = /^nM, Vn G M 



(19) 
(20) 
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The solution of the linear equations gives the NE, and its closed form has been addressed in [22] for 
Tn = l,Vn G AA. For the general case, it is easy to verify that the NE is given by 



r = — ,VneAA. (21) 



Similarly, to compute the Pareto boundary of Type II games. Equation ([T4l) can be reduced to 

(1 + a;„/?„)T„a„ + UnPn ^ T^Om = w„/3„/U, Vn G J\f. (22) 

The solution is given by 

= n —,\/neM. (23) 

From Section II.B, we know that the region logU is convex. Therefore, we can compare the efficiency 
of and a^^ using the system-utility metric X]^=i '^n ^°s[^n(^)]- Specifically, we have 

n=l """^^ ^ n=l Wn(l + Ei=l/?j) 1 + Ei=l /^j 

Denote t^o = 1, xq = i , v^Tv' ;L \ it'n = and x„ = — 7T-f;V-^, Vn e AA. Therefore, 



n=l n=l n=0 n=0 

Using the inequalities among the arithmetic, geometric and harmonic means [24], we have 



(1 + En=l ^^/?n)(l + E:=1 /3n) E„=0 " n=0 " E 



(25) 



(26) 



n=0 



Both inequalities hold with equality if and only if xq = xi = . . . = xat, i.e. = . . . = wat = 1. 
However, since we require E^=i = 1, (l26l ) holds as strict inequalities, which leads to 

n=l (l + En=l'^n/3n)(l + En=l/5n) „=i ^"1^ j 

Based on Equation (|27] ). we can make two important observations. First, due to the lack of coordination, 
the NE in Type II games is always strictly Pareto inefficient. Second, as opposed to Type I games where 
NE may result in zero utility for certain users [15], the efficiency loss in Type II games are lower bounded, 
which means that every user receives positive payoff at NE. Noticing that the performance gap between 
ti„(a^^) and nn(a^^) is non-zero, we will investigate how the non-cooperative CE solution can improve 
the system performance for Type II games. 
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IV. Conjectural Equilibrium for the Linearly Coupled Games 

A. Definitions 

In game-theoretic analysis, conclusions about the reached equiUbria are based on assumptions about 
what knowledge the players possess. For example, the standard NE strategy assumes that every player 
believes that the other players' actions will not change at NE. Therefore, it chooses to myopically 
maximize its immediate payoff [18]. Therefore, the players operating at equiUbrium can be viewed as 
decision makers behaving optimally with respect to their beliefs about the strategies of other players. 

To avoid detrimental Nash strategy and encourage cooperation, the conjecture-based model has been 
introduced by Wellman and others [20] [21] to enable non-cooperative players to build belief models 
about how their competitors' reactions vary in response to their own action changes. Specifically, each 
player has some belief about the state that would result from performing its available actions. The belief 
function Sn is defined to be s„ : An —>■ >5„ such that s„(a„) represents the state that player n beheves it 
would result in if it selects action a„ . Notice that the beliefs are not expressed in terms of other players' 
actions and preferences, and the multi-user coupling in these beliefs is captured indirectly by individual 
players forming conjectures of the effects of their own actions. By deploying such a behavior model, 
players will no longer adopt myopic behaviors that do not forecast s„, but rather they will form behefs 
Snicin) about how their actions a„ will influence the aggregate effects s„ incurred by their competitors' 
responses and, based on these beliefs, they will choose the action a„ G An if it believes that this action 
will maximize its utility. The steady state of such a play among belief-forming agents can be characterized 
as a conjectural equilibria. 

Definition 3: In the game T, a configuration of behef functions (s^, . . . , s^) and a joint action a* = 
(o*, . . . , a^) constitute a conjectural equilibrium, if for each n E Af, 

Sn(«n) = Sn{al, • • • , Oat) and a* = arg max (a„), a„). 

an&An 

From the above definition, we can see that, at CE, all players' expectations based on their beliefs are 
reaUzed and each agent behaves optimally according to its expectation. In other words, agents' behefs 
are consistent with the outcome of the play and they use "conjectured best responses" in their individual 
optimization program. The key challenges are how to configure the belief functions such that cooperation 
can be sustained in such a non-cooperative setting and how to design the evolution rules such that the 
communication system can dynamically converge to a CE having satisfactory performance. 
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B. Linear Beliefs 

As discussed before, the belief functions need to be defined in order to investigate the existence of CE. 
To define the belief functions, we need to express agent n's expected state s„ as a function of its own 
action a„. The simplest approach is to design linear belief models for each user, i.e. player n's belief 
function takes the form 

Sn(an) = Sn - A„(a„ - a„), (28) 

for n € N. The values of Sn and a„ are specific states and actions, called reference points and A„ is 
a positive scalar. In other words, user n assumes that other players will observe its deviation from its 
reference point a„ and the aggregate state deviates from the reference point s„ by a quantity proportional 
to the deviation of a„ — a„. How to configure s„, a„, and A„ will be addressed in the rest of this paper. 
We focus on the linear belief represented in ( [28] ). because this simple belief form is sufficient to drive 
the resulting non-cooperative equilibrium to the Pareto boundary. 

The goal of user n is to maximize its expected utility On" • Sn{an) taking into account the conjectures 
that it has made about the other users. Therefore, the optimization a user needs to solve becomes: 



max 



Ara(ffln ~ On) 



(29) 



For Afe > 0, user n believes that increasing an will further reduce its conjectured state Sn- The optimal 
solution of (|29l ) is given by 

A„(l + /3„) • ^■'"^ 



In the following, we first show that forming simple linear beliefs in (1281) can cause all the operating 
points in the achievable utility region to be CE. 

Theorem 3: For Type II games, all the positive operating points in the utility region U are essentially 
CE. 

Proof: For each positive operating point (nj, . . . (i.e. n* > 0, Vn € J\f) in the utility region U, 
there exists at least one joint action profile (a*, . . . ,a^) € A such that u* = u„(a*), Vn € M. We 
consider setting the parameters in the belief functions {sn{an)}n=i to be: 

A; = /3„. ^ ^"^=1 VngAA. (31) 



It is easy to check that, if the reference points are s„ = /u — J2m=i '^mO-m^ = ^n' have Snia^) = 
Sn{al, . . . , a^) and a* = arg raaXa^eAr, Un{sn{an),an)- Therefore, this belief function configuration and 
the joint action a* = (aj, . . . , a*^) constitute the CE that results in the utility {u\, . . . , u^). ■ 
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Theorem [3] establishes the existence of CE, i.e. for a particular a* € A, how to choose the parameters 
{sn,an, \n}n=i ^uch that a* is a CE. However, it neither tells us how these CE can be achieved and 
sustained in the dynamic setting nor clarifies how different belief configurations can lead to various CE. 

We consider the dynamic scenarios in which users revise their reference points based on their past 
local observations over time. Let s^, a^, sf^, s^, be user n's state, action, belief function, and reference 
points at stage t, in which = — J2m=i '^mO-m- propose a simple rule for individual users to 
update their reference points. At stage t, user n sets its and to be and a^~^. In other words, 
user n's conjectured utility function at stage t is 



N 

' ^ ] (32) 

m=l 

Since we have defined the users' utility function at stage t, upon specifying the rule of how user n updates 
its action based on its utility function ufj(s^(a„), a„), the trajectory of the entire dynamic process 
is determined. The remainder of this paper will investigate the dynamic properties of the best response 
and Jacobi update mechanisms and the performance trade-off among the competing users at the resulting 
steady-state CE. In particular, for fixed {A„}^^^, Section IV-C derives necessary and sufficient conditions 
for the convergence of the best response and the Jacobi update dynamics. Section IV-D quantitatively 
describes the limiting CE for given {A^}^^^ and investigates how the parameters {A„}^^^ should be 
properly chosen such that Pareto efficiency can be achieved. 

C. Dynamic Algorithms 

1 ) Best Response: In the best response algorithm, each user updates its action using the best response 
that maximizes its conjectured utility function in ( [32l ). Therefore, at stage t, user n chooses its action 
according to 



We are interested in characterizing the convergence of the update mechanism defined by (1331) when using 
various An to initialize the belief function s„. 

To analyze the convergence of the best response dynamics, we consider the Jacobian matrix of the 
self-mapping function in (l33l) . Let Jj^ denote the element at row i and column k of the Jacobian matrix 
J. The elements of the Jacobian matrix J^^ of (l33l) are defined as: 

RR da' f ¥utPy^ if i = k, 
^ I A,(l+/3.)' " ^ ^ 
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For Type II games, the following theorem gives a necessary and sufficient condition under which the 
best response dynamics defined in (1331 ) converges. 

Theorem 4: For Type II games, a necessary and sufficient condition for the best response dynamics 
to converge is 



N 

E 



< 1. 



(35) 



^^A„(l + 2/3„) 

Proof: The best response dynamics converges if and only if the eigenvalues of the Jacobian 

matrix J^^ in (l34l ) are all inside the unit circle of the complex plane [25], i.e. \S,n^'\ < 1, Vn € M. To 
determine the eigenvalues of J^^, we have 



det(e/-J 



BR\ 



_ /3i(Ai-ri) 
Ai(l+/3i) 



/3jvTl 

An(1+/3n) 

_ /3i(Ai-Ti) 
Ai(l+/3i) 

/32T1 



A2(l+/32) 
/3]VTl 

Ajv(l+/3«) 
/3 



Ai(l+/3i) 

_ A(A2-T2) 
A2(l+/32) 



/?WT2 

Ajv(l+/3jv) 

/32 



22. 

Tl V 1+/3; 



1+/32 



N 



Ai(l+/3i) 

A2(l+/32) 
/3w(AjV— Tjv) 

A„(l+/3„) 



Tl U+/3; 



i+/3i) ■ Sn=i wiri+s 



0- 







/3iv 
l+/3iv 



1+/32 



A«(l+/3jv) 

Therefore, we can see that, the eigenvalues of J^^ are the roots of 



N 



N 



n=l ^ri\^ /3„ 



1 



fin 



n=l 



l + /3n 



)=0. 



(36) 



Denote q{d = Yl,n=i ITiT^f+aZTT- First, we assume that (3i / j- Without loss of generality, con- 

sider (3i < {32 < ■ ■ ■ < Pn- In this case, the eigenvalues of J^^ are the roots of q{S^) = 1. Note that q{^) 
is a continuous function and it strictly increases in (— cxd, j^^), (t+^; T+/3^)' ' ' ' ' ( i+fe^ i ' T+f^)' ^^'^ 



( -if^ , +oo). We also have lim^ ^ g(^) = +cx), liiiif / 3„ (?(^) = — oo, n = 1, 2, • • • , A^, and 

limg__oog(0 = Iim5^+oo9(0 = 0. Therefore, the roots of q{0 = 1 lie in (-00, j^), {j^, j^), 
■ • • . (t+I^' T+f^)- '^i"'^^ l(0 strictly increases in (-00, jf^), we have l^^^^l < 1, Vn G if and 



-/3jv-i' 1+/3 

only if q{—l 



A„(l+2/3„) 



< 1. 
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Second, we consider the cases in which there exists /3j = (3j for certain i, j. Suppose that {Pn}n=i t^^e 
K discrete values /ti, • • • ,kk and the number of {j3n}n=i that equal to Kk is Uk- In this case, Equation 
( l36b is reduced to 

TV K 

\y ll •TT(e-T^)"'=0- (37) 

Hence, equation g(^) = 1 has N + K — J2k=i "-fc roots in total, and ^ = is a root of multiplicity 
rifc — 1 for Equation ( [371 ). V/c. All these roots are the eigenvalues of matrix J^^. Similarly, the roots of 
q{0 = 1 lie in (-oo, (^^^, A necessary and sufficient condition 

under which \^^^\ < 1, Vn G A/" is still q{-l) < 1, i.e. J2n=i xji+w^) < 1" " 

Remark 2: Theorem |4] indicates that, if the condition in (1351 ) is satisfied, the best response dynamics 
converges linearly to the CE. The convergence rate is mainly determined by max„g^ |^^^|. Suppose 
Pi < l32 < ■ ■ ■ < Pn and ^ < ^ < • • • < From the proof of Theorem IH we can see 

that, under condition m, -1 < ^ < ll^ < ^2'''' < • ' ' < ^n""' and < < 

Therefore, the rate of convergence can be approximated by max{[^^^|, [^^^|}. Note that choosing 
larger {A„}^^^ increases ^i^. Hence, if —1 < < — l^^^^l, increasing {A„}^^^, i.e. having more 
self-constraint users, accelerate the convergence rate of the best response mechanism. On the other hand, 
since > , ^^'^ , the convergence rate is lower bounded by , ^^'^ ■ Therefore, if more than two 
users associate large weighting factors /? with their individual actions in the utility functions, we have 
-1^;^^"^ ^ — i> 1 and the best response dynamics converges slowly. 

Remark 3: Theorem |4] generalizes the necessary and sufficient condition derived in [22], where users 
are assumed to be symmetric, i.e. r„ = 1, Vn and they adopt the Nash strategy by choosing A„ = t„, Vn. 
Due to lack of symmetry, the derivation in [22] is not readily applicable to analyze the convergence of 
the best response dynamics. The proof of Theorem |4l instead directly characterizes the eigenvalues of 
the Jacobian matrix, and hence, provides a more general convergence analysis of the dynamic algorithms 
that allow users to update their actions based on their independent linear conjectures. 

Remark 4: In Type II games, a locally stable CE is also globally convergent, which is purely due to 
the property of its utility functions specified in ( [TSl l. From ([34] ). we can see that all the elements in J^^ 
are independent of the joint play a*~^. This is in contrast with Type I games considered in [15], where 
local stabihty of a CE may not imply its global convergence and the best response dynamics may only 
converge if the operating point is close enough to the steady-state equilibrium. 

2) Jacobi Update: We consider another alternative strategy update mechanism called Jacobi update 
[23]. In Jacobi update, every user adjusts its action gradually towards the best response strategy. At stage 
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t, user n chooses its action according to 

ai = J„(a*-i) := aJr' + e[5n(a*-^) - a''^], (38) 

in which the stepsize e > and i?„(a*^^) is defined in ( [33l ). The following theorem establishes the 
convergence property of the Jacobi update dynamics. 

Theorem 5: In Type II games, for given {r„, /3„, \n}^^i, the Jacobi update dynamics converges if the 
stepsize e is sufficiently small. 

Proof: The Jacobian matrix i"^^ of the self-mapping function (|38] ) satisfies J"^^ = (1 — e)I + ej^^. 
Therefore, its eigenvalues {S^^^}^^^ are given by ^'^^ = 1 — e + e^^^. From the proof of Theorem |4l 
we know that < 1, Vn G A/". Therefore, if e < i.^in ^^^^ G (-1, 1), Vn G A/ and the 

Jacobi update dynamics converges. ■ 

Remarks.- Theorem |5] indicates that, for any {t„, /3„, A„}^^^ > 0, the Jacobi update mechanism 
globally converges to a CE as long as the stepsize is set to be a small enough positive number. In other 
words, the small stepsize in the Jacobi update can compensate for the instability of the best response 
dynamics even though the necessary and sufficient condition in ( [35l) is not satisfied. 



D. Stability of the Pareto Boundary 

In order to understand how to properly choose the parameters {Xn}n=i such that it leads to efficient 
outcomes, we need to explicitly describe the steady-state CE in terms of the parameters {A„}^^^ of the 
beUef functions. Denote the joint action profile at CE as (aj, . . . , a^). From Equation (1331) . we know 
that 

(A„ + /3nT„)a* + ^ f^nTma'^ = Pn^^^n G M. (39) 
m&J\f\{n} 

The solutions of the above linear equations are 

a^^ = ^, Vn G N. (40) 

Based on the closed-form expression of the CE, the following theorem indicates the stability of the Pareto 
boundary in Type II games. 

Theorem 6: For Type II games, all the operating points on the Pareto boundary are globally convergent 
CE under the best response dynamics. 

Proof: Comparing Equations ( [231 ) and ( |40l ). we can see that, (af^^, . . . ,a^^) = (af^, • • • if 
and only if A„ = Tn/uJn- Substitute it into the LHS of (l35l) : 

A„(l + 2/?„) l + 2/3n 2 2- ^ ^ 
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Condition (1351 ) is satisfied for all the Pareto-optimal operating points. In fact, we have min„ = 0, 
which is because q{0) = Yln=i Y' — Sn^=i = 1- Therefore, under the best response dynamics, the 
Pareto boundary is globally convergent. ■ 

In addition, we also note that Theorem [5] already indicates the stability of the Pareto boundary under 
Jacobi update as long as the parameters {r„,/3„, Xn}n=i are properly chosen. 

Remark 6: Since J2n=i "^n ~ ^' '^^^ from the previous proof that, the belief configurations 
{Xn}n=i lead to Pareto-optimal operating points if and only if 

N 

E = 1- (42) 

n=l 

Therefore, we can see that, to achieve Pareto-optimality in these non-cooperative scenarios, users need 
to choose the belief parameters {A„}^^^ to be greater than or equal to the parameters {t„}^^^ in the 
utility function {un}n=i and the summation of should be equal to 1. Define user n's conservativeness 
as Y', which reflects the ratio between the immediate performance degradation — r„Aa„ in the actual 
utility function and the long-term effect — A„Aa„ in the conjectured utility function if user n increases 
its action by Aa„. The condition in Equation (l42l ) indicates that, to achieve efficient outcomes, the 
non-collaborative users need to jointly maintain moderate conservativeness by considering the multi- 
user coupling and appropriately choosing {Xn}^=i- By "moderate", we mean that users are neither too 
aggressive, i.e. A„ Tn and Ylin=i IT ~^ conservative, i.e. A„ —^ +oo and Yln=i F" ^ 

If more than one user plays the Nash strategy and choose An = Tn, Equation (|42l) does not hold and the 
resulting operating point is not Pareto-optimal. Therefore, myopic selfish behavior is detrimental. 
Similarly as in ((24l) . we have 

^ , nn(a^^) ^ rn(l + Ef=i^./3,) 1 + Ef=i 

n=l "'^ ' n=l ■^n^n\^ ^ l^j=\ > ^ ^ l^j=l 

Using Jensen's inequaUty, we can conclude Y.n=i log ^"[^Pi;| < and YZ=i log ""uX^pb] = 
if and only if Wn = j^,Vn. Therefore, if a CE is Pareto efficient, user n's conservativeness Tn/\n 
corresponds to the weight assigned to user n in the weighted proportional fairness defined in (ITOl ). 

As an illustrative example, we simulate a three-user system with parameters /? = [1.5 1 0.5], r = 
[3 4 5], /i = 10, LOn = ^, Vn. In this case, the joint actions and the corresponding utilities at NE and Pareto 
boundary are summarized in Table I. The price of anarchy quantified according to ( [271 ) is —0.2877 and 
the lower bound in ( |27] ) is —0.5754. As discussed in Section III.C, both the upper bound and lower 
bound in (|27] ) are not tight. Fig. [T] shows the trajectory of the action updates under both best response 
and Jacobi update dynamics, in which a!^ = 0.5, An = ^,Vn, and e = 0.5. The best response update 
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converges to the Pareto-optimal operating point in around 8 iterations and the Jacobi update experiences 
a smoother trajectory and the same equilibrium is attained after more iterations. 

E. Discussions 

1) Comparison between Type I and Type II games: As mentioned before, the properties of Type I 
games have been investigated in the context of wireless random access [15]. Table II summarizes some 
similarities and differences between both types of games. First, the two algorithms exhibit different 
properties under the best response dynamics. In Type I games, the stable CE may not be globally 
convergent. However, the local stability of a CE implies its global convergence in Type II games. Second, 
it is shown in [15] that any operating point that is arbitrarily close to the Pareto boundary of the utility 
region of Type I games is a stable CE. Similarly, the entire Pareto boundary of Type II games is also stable. 
At last, different relationships between the parameter selection and the achieved utility at equilibrium 
have been observed for the two types of games. In particular, in Type I games, user n's utility u„ is 
approximately proportional to the inverse of the parameter A„ in its belief function. In contrast, in Type 
II games, if the CE is Pai^eto-optimal, the ratio x„/A„ coincide with the weight Un assigned to user n 
in the proportional fairness objective function. In other words, based on the definition of proportional 
fairness [26], we know 



in which {u[,U2, . . . , u'j^) is the users' achieved utility associated with any other feasible joint action and 
{ul,U2, . . . , u*^) is the optimal achieved utility for problem ( [TOt with tOn = Tn/Xn and ^2^=1 = 1- 

2 ) Pricing Mechanism vs. Conjectural Equilibrium: In order to achieve Pareto-optimality, information 
exchanges among users is generally required in order to collaboratively maximize the system efficiency. 
The existing cooperative communication scenarios either assume that the information about all the users 
is gathered by a trusted moderator (e.g. access point, base station, selected network leader etc.), to which 
it is given the authority to centrally divide the available resources among the participating users, or, in 
the distributed setting, users exchange price signals (e.g. the Lagrange multipliers for the dual problem) 
that reflect the "cost" for consuming per unit constrained resources to maximize the social welfare and 
reach Pareto-optimal allocations. As an important tool, the pricing mechanism has been applied in the 
distributed optimization of various communication networks [12]. However, we would like to point out 
that, the pricing mechanism generally requires repeated coordination information exchange among users 
in order to determine the optimal actions and achieve the Pareto-optimality. In contrast, for the linear 




(44) 
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coupled communication games, since the specific structure of the utility function is explored, the CE 
approach is able to calculate the Pareto efficient operating point in a distributed manner, without any real- 
time information exchange among users. In fact, the underlying coordination is implicitly implemented 
when the participating users initialize their belief parameters. Once the belief parameters are properly 
initialized by the protocol according to (l42l ). using the proposed dynamic update algorithms, individual 
users are able to achieve the Pareto-optimal CE solely based on their individual local observations on 
their states and no message exchange is needed during the convergence process. Therefore, the conjecture 
equilibrium approach is an important alternative to the pricing-based approach in the linearly coupled 
games. 

V. Conclusion 

We derive the structure of the utility functions in the multi-user communication scenarios where a user's 
action has proportionally the same impact over other users' utilities. The performance gap between NE 
and Pareto boundary of the utility region is explicitly characterized. To improve the performance in 
non-cooperative cases, we investigate a CE approach which endows users with simple linear beliefs 
which enables them to select an equiUbrium outcome that is efficient without the need of explicit 
message exchanges. The properties of the CE under both the best response and Jacobi dynamic update 
mechanisms are characterized. We show that the entire Pareto boundary in linearly coupled games is 
globally convergent CE which can be achieved by both studied dynamic algorithms without the need 
of real-time message passing. A potential future direction is to see how to extend the CE approach to 
the general linearly coupled games that are compositions of the basic two types and certain particular 
non-linearly coupled multi-user communication scenarios. 
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TABLE I 

Actions and payoffs at NE and Pareto boundary. 





User 1 


User 2 


User 3 




1.25 


0.625 


0.25 




3.4939 


1.5625 


1.25 




0.833 


0.417 


0.167 




3.8036 


2.0833 


2.0412 



TABLE II 

Comparison between Type I and Type II games. 



Games 


Best response dynamics 


Stability vs. efficiency 


Fairness vs. parameter selection 


Type I 


local stability <^ global convergence 


stable at near-Pareto-optimal points 


tin OC T„/Xn 


Type II 


local stability global convergence 


stable at the Pareto boundary 


iOn = rn/\n at the Pareto boundary 



